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1 GCCATGGTGG GGCAGAGGTT GGGAAGATGG CGTGGCGAGG CTGGGCGCAG 
51 AGAGGCTGGG GCTGCGGCCA GGCGTGGGGT GCGTCGGTGG GCGGCCGCAG 
101 CTGCGAGGAG CTCACTGCGG TCCTAACCCC GCCGCAGCTC CTCGGACGCA 
151 GGTTTAACTT CTTTATTCAA CAAAAATGCG GATTCAGAAA AGCACCCAGG 
201 AAGGTTGAAC CTCGAAGATC AGACCCAGGG ACAAGTGGTG AAGCATACAA 
251 GAGAAGTGCT TTGATtCCTC CTGTGGAAGA AACAGTCTTT TATCCTTCTC 
301 CCTATCCTAT AAGGAGTCTC ATAAAACCTT TAI 1 1 I I IAC TGTTGGGTTT 
351 ACAGGCTGTG CATTTGGATC AGCTGCTATT TGGCAATATG AATCACTGAA 
401 ATCCAGGGTC CAGAGTTATT TTGATGGTAT AAAAGCTGAT TGGTTGGATA 
451 GCATAAGACC ACAAAAAGAA GGAGACTTCA GAAAGGAGAT TAACAAGTGG 
501 TGGAATAACC TAAGTGATGG CCAGCGGACT GTGACAGGTA TTATAGCTGC 
551 AAATGTCCTT GTATTCTGTT TATGGAGAGT ACCTTCTCTG CAGCGGACAA 
601 TGATCAGATA TTTCACATCG AATCCAGCCT CAAGTGTTAT TTCCAATTTT 
651 GTCAGTTACG TGGGTAAAGT TGCCACAGGA AGATATGGAC CATCACTTGG 
701 TGCATCTGGT GCCATCATGA CAGTCCTCGC AGCTGTCTGC ACTAAGATCC 
751 CAGAAGGGAG GCTTGCCATT ATTTTCCTTC CGATGTTCAC GTTCACAGCA 
H 801 GGGAATGCCC TGAAAGCCAT TATCGCCATG GATACAGCAG GAATGATCCT 
Q 851 GGGATGGAAA I I I I I IGATC ATGCGGCACA TCTTGGGGGA GCTCTTTTTG 
|; 3 901 GAATATGGTA TGTTACTTAC GGTCATGAAC TGATTTGGAA GAACAGGGAG 
951 CCGCTAGTGA AAATCTGGCA TGAAATAAGG ACTAATGGCC CCAAAAAAGG 
1001 AGGTGGCTCT AAGTAAAACT GGGATTGGAC AGTAGTGGTG CATCTGGTCC 
1051 TTGCCGCCTG AGAGCCCCAG GAGACATCGG CTAGAGTGAC CATGGCTATG 



fy 1101 CTCCCGTCTG GAAGATGCCA GCATCTGGCC TCCCACTGTT TTCAGCTGTG 
1151 TCCCCCAGTC CGTGTCTTTT TAGAATGTGA ATGATGATAA AGTTGTGAAA 
i«& 1201 TAAAGGTTTC TATCTAGTTT GTAAAAAAAA AAAAAAAAAA AAAAAAA (SEQ ID NO:l) 

I'U 

if FEATURES: 
£ S'UTR: 1-26 
H Start Codon: 27 
i! Stop Codon: 1014 
3'UTR: 1017 

Homologous proteins: 

gi 1 110662 50 1 gb | AAG28519 . 1 1 AF197937JL (AF197937) preseni 1 i ns i nt . . . 668 0.0 

gi 189241341 ref I NP_061092.1 1 hypothetical protein PRO2207 [Homo ... 264 le-69 

gi 1 7303544 1 gb | AAF58598.il (AE003824) CG8972 gene product [Droso. . . 186 4e-46 

gi 1 321992 5 1 sp 1 014364 1 YB4J_SCHP0 HYPOTHETICAL 33 . 6 KD PROTEIN C3 . . . 69 le-10 

gi 1 6321538 1 ref | NP_011615 . 1 1 YgrlOlwp [Saccharomyces cerevi si ae] . . . 64 3e-09 
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EST: 

gi I 10216540 /dataset=dbest /taxon=96.. 
gi 1 10215044 /dataset=dbest /taxon=96.. 
gi I 10212049 /dataset=dbest /taxon=96.. 
gi I 10154606 /dataset=dbest /taxon=96.. 
gi 1 9141009 /dataset=dbest /taxon=9606. 
gi 19338606 /dataset=dbest /taxon=960. . 
gi 19720819 /dataset=dbest /taxon=960. . 
gi 1 5857747 /dataset=dbest /taxon=9606 
gi I 10813749 /dataset=dbest /taxon=960. 
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EXPRESSION INFORMATION FOR MODULATORY USE: 
gi I 10216540 Lung 

gi I 10215044 Lung small cell carcinoma 

gi I 10212049 Lung small cell carcinoma 

gi I 10154606 Ovary adenocarcinoma 

gi 1 9141009 Lung 

gi 19338606 Uterus endometrium 

gi 1 9720819 Lymph Burkitt lymphoma 

gi 1 5857747 Colon 

gi I 10813749 Dendritic cells 

Tissue Expression: 
Human leukocytes 
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1 MAWRGWAQRG WGCGQAWGAS VGGRSCEELT AVLTPPQLLG RRFNFFIQQK 
51 CGFRKAPRKV EPRRSDPGTS GEAYKRSALI PPVEETVFYP SPYPIRSLIK 
101 PLFFTVGFTG CAFGSAAIWQ YESLKSRVQS YFDGIKADWL DSIRPQKEGD 
151 FRKEINKWWN NLSDGQRTVT GIIAANVLVF CLWRVPSLQR TMRYFTSNP 
201 ASSVISNFVS YVGKVATGRY GPSLGASGAI MTVLAAVCTK IPEGRLAIIF 
251 LPMFTFTAGN ALKAIIAMDT AGMILGWKFF DHAAHLGGAL FGIWYVTYGH 
301 ELIWKNREPL VKIWHEIRTN GPKKGGGSK (SEQ ID NO: 2) 

FEATURES: 

Functional domains and key regions: 
Prosite results: 

[1] PDOC00001 PS00001 ASNJGLYCOSYLATION 
N-glycosylation site 

161-164 NLSD 



[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches: 3 

1 123-125 SLK 

2 142-144 SIR 

3 217-219 TGR 



[3] PDOC00006 PS00006 CK2_PH0SPH0_3ITE 
Casein kinase II phosphorylation site 

Number of matches: 3 

1 25-28 SCEE 

2 69-72 TSGE 

3 130-133 SYFD 



[4] PDOC00008 PS00008 MYRISTYL 
N-my ri stoyl ati on si te 

Number of matches: 10 

1 12-17 GCGQAW 

2 14-19 GQAWGA 

3 18-23 GASVGG 

4 22-27 GGRSCE 

5 110-115 GCAFGS 

6 171-176 GIIAAN 

7 225-230 GASGAI 

8 228-233 GAIMTV 
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9 272-277 GMILGW 
10 288-293 GALFGI 



[5] PDOC00009 PS00009 AMIDATTON 
Amidation site 

39-42 LGRR 

Membrane spanning structure and domains: 



Helix Begin 


End 


Score Certainity 


1 


107 


127 


1.825 Certain 


2 


173 


193 


1.069 Certain 


3 


226 


246 


1.654 Certain 


4 


250 


270 


1.382 Certain 


5 


288 


308 


1.123 Certain 



BLAST Alignment to Top Hit: 
Alignment to top blast hit: 

>gi 1 11066250 |gb|AAG28519.1|AFl97937_J. (AF197937) presenilis 
interacting rhomboid-like protease [Homo sapiens] 
Length = 379 

Score = 668 bits (1706), Expect = 0.0 

identities = 327/379 (86%), Positives = 328/379 (86%), Gaps = 50/379 (1390 
Frame = +3 

Query: 27 MAWRGWAQRQrVG^ 206 

MAWRGWAQRGWGCGQAWGASVGGRSCEEL^ 
Sbjct: 1 MAWRGWAQRGWGCGQAWGASVGGRSCEELTAV^ 60 

Query: 207 EPRRSDPGTSGEAYKRSALIPFVEETVFYFSPYra 386 

EPRRSDPGTSGEAYKRSALIPPVEEWFYPSPYPIRSLIKPLFFTVGF^ 
Sbjct: 61 EPRRSDPGT5GEAYKRSALIPFVEETVFYPSPYPIRSLIKPLFFTVGFTGCAR3SAAIV\^ 120 

Query: 387 YESLKSRVQSYHXZKADWLDSIRP^ 566 

YESLKSRVQSYFBGIKADlAiUJSI^ 
Sbjct: 121 YESLKSRVQSYFlX;iKADWLi)SIRPQKEGDFW 180 

Query: 567 CLWRVPSLQRTMIRYFTSNPAS 632 

CLWRVPSLQRTMIRYFTSNPAS 
Sbjct: 181 CLWRVPSLQRTMIRYFT5NPASKVLCSPMLLSTFSHFS 240 

Query: 633 SVISNFVSWGKVATGRYGPSLGASGAIMWUV^VCTKIPEGRL^IIF 776 

VISNFVSY4GKVATGRYGPSLGASGAIWTVl^VCTKIPEGRLAIIF 
Sbjct: 241 GQEQFMAWLSAGVISNFVSYLGKVATGRYGPSLGASGAI^ 300 
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Query: 777 LFWFTFTAGNALKAIiyVClTAG^ 956 

LPMRFTAGNALKAIIAIvDTAGMILG/y/KFFDHAA^ 
Sbjct: 301 LPMFTFTAGNALKAIIANCTAG!^ 360 

Query: 957 VKIWHEIRTNGPKKGGGSK 1013 

VKIWHEIRTNGPKKGGGSK 
Sbjct: 361 VKIWHEIRTNGPKKGGGSK 379 (SEQ ID NO:4) 

Hmmer search results (Pfam): 

Scores for sequence family classification (score includes all domains): 
Model Description Score E-value N 



PF01694 Rhomboid family 23.3 1.8e-05 1 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hrrni-t score E-value 



PF01694 1/1 201 292 .. 59 147 . . 23.3 1.8e-05 
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1 CGAGGTTTCT TCATGTTGGT 
51 GATCCGTCCG CCTCAGCCTC 
101 CCACCGCACC CGGCCTTTAT 
151 TTTTAATCAC TTTATCCAGA 
201 GCCTGTGGTT TCCAGAAGCT 
251 GTTGCCCATG GAACTGACAG 
301 GCCTAGTAGG CAGGATCAGG 
351 ACAATTAGAA TAAAACACCA 
401 CGATCTTTAT GATCATGGCC 
451 ATATCAATCA CTGGGTATGA 
501 GTAAAGATGA TTAACTCTCC 
551 GAATCACTGC TCTCTTCCTA 
601 GTGAGGAGGC TCTGCCCGCC 
651 GGGAGGTGTA CACAGTGCTC 
701 TTTTCCCATC TTTGCATAAA 
751 TCTGGGTAAT TATTTACCAA 
M 801 AGTGAGACAT CTGATACAAC 
Q 851 AATTTTATTT AATGTACTAC 
H 901 ATGCATCCTG ATTTCAGATA 
** 951 GAAAAATGTA GCTTCCTTTC 
"5 1001 ACAGACTAGA AATGCCAGGG 
q 1051 TGCGGAAGCC ACAGGTAACA 
I ; y 1101 ACAAAGAATG TTTTCTAAAG 
1151 CAGTCCTTAT TGGCATCGAA 
M 1201 ACAGAGACTG GAGGAATGAC 
!'U 1251 AGAGGAATAC CTGCTATGTG 
1301 TAI 1 1 I I IGA AGTTTGTAGT 
;£ 1351 ACAAGTTAAT TATATTATCG 
1401 ATCAACACTT ACAGAAAAAG 
1451 ACTTTAGAAG CAGTTGCAGA 
1501 ACATGACTAT TTTTCTCAGA 
1551 ACATATGTTT CCATAAGCTG 
1601 GAGGAAAAAG AAGTAATGTT 
1651 TTCTGTATAT TACTTCTGTC 
1701 TTGATCTGAT TATAATTGAG 
1751 TGGATCTAGG GAAAGGAAGT 
1801 AATTCTGTGA CTTTACCAAC 
1851 CATTTCACTT GO 1 1 1 1 1 1 1 
1901 ACTCTTACAT AATTGTGGAA 
1951 CTTTCCTGCT TTCAI 1 1 1 IA 
2001 AGGTATTATA GCTGCAAATG 
2051 CTCTGCAGCG GACAATGATC 
2101 AAGTCTAACT TGTGTGAATT 
2151 TATGCTTTAG TTAATGGAAG 
2201 AGCTACAAGC AAAATGCAGA 
2251 AGGGACCTCA CCTCTCTTTT 



CAGGCTGGTC TCGAACTCCC GACCTCAGGT 
CCAAAGTACT GCTGGGATTA CAGACGTGAG 
CTTTCATTTT TTTTCATGTA TTTTCCTTTA 
AACATATCCT CGTCTTGACA GTGCTGTGGT 
GGGTGTGCTG TGTGTCTGTG GTTTGAGGAA 
AGGAAGCAGA GTAGTCGTTG CCA 1 1 1 I ICA 
GACCCCATCT TGCTCTCTTT GCCTTGAACC 
AAGCCCTGAC TGATCATGAT CATAGCAATC 
AGACCATTCT CAGGTCGTCT TTACCCTAAG 
CAACCTAGAC CTAAGGGTGC ACTCTGGGTA 
CAAAGGAATC TAAGGAATCC AGAGCAACAC 
TAGGGTAAAC CTCCCAAGAC TCCAGTCCCT 
TGCCCTTCCC AGGGTTCCAG GCTCCACATT 
TTCGCTCTTC ATTGCCTTGT GTATGATCCC 
TGCTGTCCCT CTCACCATCT TTAAAAGAGT 
AGGTGGTATA ATGCTGTCAC AGTCCCTGCT 
TGATGGAATC AGTTCAACAA AATGCAGTAA 
GGAGAAAGAA AAAATGCTAC CAGTTATAAG 
TTAAAATGGA AAAAATGTCT TAAGATCTGT 
CCACCTCTCA AGTGGGAGAG CAAAAACTGG 
GCTAGCTGAG AACCTTACAG AATGAGCAAC 
CCGAGATGTA GATCAGCTGC CAGGGACAAG 
TAAATCCTCT TACCAGTATG TTATTGAAAT 
GAAGGTGAAA GTGCTACTTG CCTGTTGCCT 
AAATGTTTAA ATTATTTTAA TTCAACAAGT 
AAGGAGTTGT GGCAATTCAT AAAATTAATA 
TTTCAATAAT AATTTCTTAT CTAAAATGTA 
AATAAACCTC AATTTCGTAG TACTAACAAC 
GAAAGTCACT CAACTCCCAC ATGTAAACAG 
GGTTTTCTAA ATTATCCCTG AATTCCTATC 
CATGTTGACC TTCACCTACA CAGATGACTC 
GCAGTAAGTT TAAGAAGCAT ACCATGCCCT 
AGCTCTTCTA CTCTTGGCCA AAGAACCTAA 
TTTGGTTTGG CTATTATAGA CAATAAATTA 
AAAAGTAAGC TCTTCTAAAG AAGTAAAATA 
TAGCTCCCAG AGCATTTACA ATTTCCCAGG 
CCTAGGCAGT GCTGATACTT TAAAAGCATT 
GGCTCACCCC CTATCCCCCA GGTATACAGT 
GAATCTTACA AGGGGGTAAT GTAGATCAGA 
ACCTCCCTAA ATTATAAATA TTTATTTTGT 
TCCTTGTATT CTGTTTATGG AGAGTACCTT 
AGATATTTCA CATCGAATCC AGCCTCAAGT 
TATTTTAAGG TAGAAATAAT ATGAAAGAAA 
TGCTGTAAAA AAGACGAATT ACCTATCAAT 
GGATAGGCTG TAAGCTCCTT CACTGAGGAC 
TCI 1 1 1 PCI I TGI 1 1 1 1 1 1 1 GAGACGGAGT 
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2301 CTTCCTCTGT TGCCCAGGCT GGAGTGCAGT GGTGCAGTCT TAGCTCACTA 
2351 CAACCTCCAC CTCCCAGGTT CAAGTGATTC TCCTGCCTCA GCCTCCCTAG 
2401 TAGCTAGGAT TACAGGTGCC CGCCACCACA CCCAGCTAGT TTTTGTATTT 
2451 TTAATAGAGA CAGGGTTTCA CCGTGTTGGA TAGGCTGTTC TTGAACACCT 
2501 GACCTCAGGT GATCTGCCTG GCTCGGCTGG AGTGCAGTGG CGTGATCTCA 
2551 GCTCACTGCA AGCTCCGCCT CCCGGGTTCA TGCCATTCTC CTGCCTCAGC 
2601 CTCCTGAGTA GCTGGGACTA CAGGTGCCCG CCACCACGCC CCGCTAATTT 
2651 TTTTGTATTT TTAGTAGAGA CGGGGTTTCA ACATGTTAGC CAGGATGGTC 
2701 TCGATCTCCT GACCTCGTGA TCCGCCCGCC TCAGCCTCCC AAAGTGCTGG 
2751 GATTATAGGC GTGAGCCACT GCGCCCGGCC AATTTACTTT TTATTTTATT 
2801 TTATTTTATT TTTTGAGACA GGGTCTTGCT CTGTTGCCCA GGCTAGAGTG 
2851 CAGTGATACG ATCTTGGCTC ACTGCAACCT CTGCTTCTCA GGCTCAACTG 
2901 ATCCTCCCAC CTCAGCCCCC AGGAGCTGGG ACTACAGGTG CATGCCACCA 
2951 TGCCCAGCTA Al 1 1 1 1 1 I IG 1 1 1 1 IAGTGC AGATGAGGTC TTGCCATGTT 
3001 GCCCAGACTG CTTAI I I I I I TCTAATCAAC TTTTGCCATA AGGACAAGTT 
3051 GCTTTCATTG AACTGAGAGT TTTTATTGGT TGCTTACTAA GTAGAAAAGA 
3101 ATATTTATTA AGACAGCTTT TTGTCACTTT TAAAAATGAT GTCTTAAGCT 
3151 GGGCATAGTG ACTCACATCT ATAATCCCAG CACTTGGGGA GGCTGAGGCA 
3201 GGTGAACTGC TTGAGCTCAG GAGTTCGAGA CCAGCCTGGG AAACATGGTG 
3251 AAACCCCATC TCTACTAAAA ATACAAAAAT TAGTTGGGCA TGGGGTATGT 
3301 ACCTGTGGTC CCAGCTACTC AGGGAGGCTG AGGTGGGAGG ATCACTTGAG 
3351 CCCTTGAGCC TCAACTTGAG GAAGTTGAGG CTGCAGTGAG CCAAGATCAG 
3401 TGCCACTGCA CTCCAGCCTG GGGCGACAGA GCAAGACTCT CTCCAAAAAA 
3451 AAAAAAAAGT CTTAAAAATA GCTGTTTTTG TTTTCCATGT TTGTTTCATA 
3501 AAI 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 TTTTGAGATA GAGTCTCGCT CTATGGCCCA 
3551 GGCTGGAGTG CAGTGGCTCA ATCTTGGCTC ACTGCAAACT CTACCTCCTG 
3601 GGTCCAAGTG ATTCTCCCGC CTCAGCCTTC CGAGTAGCAG GAATTACAAA 
3651 CGTGCGCCAC CACACCTGGC TAAI 1 1 I IAT ATTTTTAATA GAGATGGGGT 
3701 TTGACTATGT TGGCCAGGCT GGTCTTGAAC TCCTGACTTA GTGATCCGCC 
3751 TGCCTTGGCC TCCCAAAGTG CTGGGATTAC AGGCGTGAGC CACTGCGTCC 
3801 GGCCTAATTT TAAAAGTTTA AAATGGATAA I I I I IATTGG CTGTGTGTTT 
3851 CATGATTACC AGACTATGTT TCTCTCTCTT GTAGAGGTCC TTTGTTCTCC 
3901 AATGTTGCTG TCAACATTCA GTCATTTCTC CTTATTTCAC ATGGCAGCAA 
3951 ATATGTATGT TTTGTGGAGC TTCTCTTCCA GCATAGTGAA CATTCTGGGT 
4001 CAAGAGCAGT TCATGGCAGT GTACCTATCT GCAGGTAATA TGCTTTAATC 
4051 TCGGGGCCTT TGAGAGTATA AGCACTCTAA GCTATCTGCA GAACGGACAA 
4101 AGGGAATGAT TACTGCCATA TTCTACACGT AGTGAGTGCT CAGAACATAT 
4151 TTGTTTCTCA CAGTGTATGT AGAGAAGGGA GCCACAGATT GGTGGAGATG 
4201 TTGCCTTTTC TGTTCATTTT GCTGATTTCT TCTTACATAT GAATTATGTG 
4251 GGTATGTTTA ATTTTAAGTT AGGATAAACA GGCGTTAAGT AAGGGTTAGT 
4301 GTAGAATTTA AGCATGTCAT TTTTGTAATC TCATCGGGCC TTGATTTCAT 
4351 TAGTTTAGGC CCTCCATTTT ATAGATAGTG GTTCCCAGAC TTCCCGGCTG 
4401 CCTCAATCTC CTGGGTCTTT GTTAAATAAC CTTAAGCAAG CTCATTTCCC 
4451 CCAGTGTGTT CAGTTCACAG AAAGCTTTAA ATCAGAGCTA TACAATATGA 
4501 TTGTCAAGAG TGAGTTTGTT CTGTCTTCTT TGCAAGAATG TAGCAGGGAA 
4551 CCACTTCCTA GCCATGGTCT TGAAGATGGT ATCGTTTCTT ATTTCAGTTA 
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4601 GGAAATTCTC ATGCATGAAT CCAGGTCCCT AGATGCTGCT AACGTGACAG 
4651 TTGGTCAAAT TTTACTTACC TCTCTGTTTG TAAAATGTAC TTACTTAATA 
4701 CAATATAAAA ATTAATTTCT AAAATCTCTA CATTTAGAAA CAGTATATCT 
4751 GGCAGTTGTG CTGTGATGTA GTGAAAAACA CTAAGCTTGG CGATAGACCC 
4801 AGGTTCAGAT CCTATTTCTA CTACCAGCTG AGTGATGTTG CAAAAATGAC 
4851 TAAACCTCAT GATACTTACC TCCTCATGAC AAGGGGTTAA AGAAAGGACT 
4901 ACATAAAAGC ATCTACCACA AGCCCCAGAG TAGATGCTTA ATTAGTGTTC 
4951 ATCGAATACT TATGTGTATC TAGTCCTTCA AAAAAAGAAG CTGAGCATTG 
5001 TGTTTGGCTT GTAAGATAAG TGTATAGTTC TTTCCCAAGC ACTAGTTATG 
5051 TTGTAGTTAC AGAGGGTCTG TTTCAGATAC ATTAATTCCT GCTCCATAGG 
5101 AGGTTTTTAA AAATGAGCCA CGTTGACTCA AATGGCACTG AAGCCAAAGA 
5151 GACTTACGGG ATCATCCAGT CTGTTGTCCC ACCCCAGATA TTCTGATTTC 
5201 GTGTGTCTGG AGTACAGCCA GAGAATATAC TCTTGGGAAT GAGTCTTCAT 
5251 GTTATAGTTG AGGAAAATGG TAACTGAGAA GTGGAGTGAA TGACCGTGTC 
5301 GCTCAGCAGA TCATGCAGCA GGTCAGACTT TTCATCCCCT GTAAAGTCGC 
5351 TGAAATGATA GGCAGGAGAA GTATTCATGC CCGTACCCTC ACAGTGATCC 
5401 AGATTGAAAC CCGACACTGT TTATCTGTGT AGAAATCAGA AATGAAAACC 
5451 ATTTTCATGG CTGGATGTGG TGCCGCACGC CTGTAATCCC AGCTACTCAG 
5501 GAGGCTGGGG GACAAGAATA ACTTGAACCC GGTAGGCAGA GGTTGCAGTG 
5551 AGCCAAAATT GTACCACTGC ACTTCAGCAG CCGGGGCGAA AGAGTGAAAC 
5601 TCTGTCTCAA AAAAAAAAAA AAAGAAAAGA AAAAAAAAAG TAAACCATTT 
5651 TTATACCTCA CTTAAATTAT TGTAATGTGA CTTGTTTTTC AGGTGTTATT 
5701 TCCAATTTTG TCAGTTACGT GGGTAAAGTT GCCACAGGAA GATATGGACC 
5751 ATCACTTGGT GCAGTAAGTA TTTCTATTGT AAAI I I I I I I TAATTTAATT 
5801 TTTAAATTTA CTTTGAAATA AGTTTAGACT TAGAAGAATG TTGTAAAATT 
5851 GATAAGTAGG TTCTCATATA CCCTTCACCC TACTGTTAAC TAACATCGAA 
5901 ACCAAGAAAT TAACATTGAA ACAATACAGT TGACTAATTT AGAATTTATA 
5951 CATTTGTAAA GCTTTGTAAA TGTCCGGCTA TAGCTTTTAA CCATTGGTCA 
6001 TATATATATG TTTACCAGAG CAGAGTATAT CTCAGAACAG TAAGTGTGCA 
6051 ATCCTCGTAA ACCAGAGAGC CTAATCCAGT ATTGGAAGAT TCTAATTATA 
6101 GATTTGAATC TGGTACTTTA TCCTCCTATT TAGTCAATAT TGGAGTGCCT 
6151 ACTAGGTGCT ATGCTAGAGC CTGGGGATAA CAGCTGGTGA GCAAGATGAT 
6201 CACGATTATT TGTGTTGGTT TTAGAAAGTG GGGAACAACA ACAACAAAAA 
6251 AGGCTCCTGC CCTCAGAGCT CTTATATTCT GGATGCTTAA AAAAAI I I 1 1 
6301 CTTAGGCTGG ATGCAGTGGT TTACACCTGT AATCCCAGCA CTTTGGGAGG 
6351 CCAAGGTGAG AGGATGAGCC CAAGAATTCG AAACCAGCCC TGGTAACATA 
6401 CCAAGATCCT ATCTGTACAA AAAAATTTAA AAAATTAACT GGGGGTGGTG 
6451 GCTTATGCCG GTAGTCTCAG CTACTCAGGA GGCTGAGGAA GGAGGATAGC 
6501 TTGAGCCTAG GAGGTTGAGG CTGCGGTGAG CTGTGATTGT ACCACTGCAC 
6551 CCCAGCCTGG GTGACATAGC AAGACCCTAT CTCAAAAAAA AAA! 1 1 1 1 1 1 
6601 TTAAGTGTGT TTTGAGGCTG GGTGCAGTGG CTCACACCTG TAATCCCAGC 
6651 ACTTTGGGAG GCTGAGGTGG GCAGCTCACT TGAGGTCAGG AGTTCAAGAC 
6701 CAGCCTGGTC AACATGGTGA AACCCTGTCC CTCCTGAAAA TACAATAATT 
6751 AGCCAGGTGT GGTTGTGCAT GCTTGTAATC CCAGCTACTC GGGAGGCTGA 
6801 GGCAGGAGAA TTACTTGAAC CCAGCGGGTA GAGGTTGCAG TGAGCTGAGA 
6851 TTGCACCACT GCACTCCAGC CTGGGTGACA GAACAAGACC CTGTCTCACA 
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6901 GAACAAGACC CTGTCTCAAA GAAAAAAAAT I I I I I IAAGT GTCTTTTGAG 
6951 TTTAATGGCA GATTTCTGGG CACATGGAAA TCTTTATGTA ATATTTCCTT 
7001 ACACATTCAG TTTGTACTTA TTTAAATACT AATTCATTTA AATGCATTCA 
7051 AATAGGGAAT TTCCTATTTA AAGGAACTCT AAAAAGGTCA ATTTTGAAAA 
7101 GAATTCTTAT GTAAAATAAC CATTCCCTAA TTTGTATGTT CCCCAAATTT 
7151 GTTTACACTr AATTTTCCTA GTGAGGCCTG TGTTCTGTCC TGTGACCACA 
7201 TGCTTTCTTA AGCCTCCTTT TTTCCCTTCG TGGAATGTTT ATTTTCTTTA 
7251 TACAATTTCG CTCTGATATA ATTTATATAT TTCGAATCAT ATTGTCTACC 
7301 TCATTCAACA GCTAAGCACC TAATATATGA AGGCAGTGAA GACCACTAGG 
7351 ATGAATCAGA GACTCAGAAT TCGAATTTAG CTGGGGAGAA AACATGCACA 
7401 CATCTAATAC ACACTGAAAG GAATGAGGAT TCTCTAGAGG ACTTTGGGGG 
7451 CTCTAAGAGT GAAGAGACCT TTCT AATTAG CTGAAAGGAC CTGCGAGGGC 
7501 ATTTTGATGT GCTCTTGGAC AGCTGTTGTC CTCATCTTAT AGATAAGAAA 
7551 CTGAAGTGCA AACTTAATGA AGT ATGGCAG TAAGGTATTT GGAGTTAGAG 
7601 TGGGGGTGAA TCCTGGTTCT GCTACTTACG TGTGATTTCT AGGACATATT 
7651 ACTGAACTTC TCTGAATTTC AGTTTCCCTT TATAAAATGG GGATAACACC 
7701 ATCTATTTCT GAGGTGCAAA GCAAGTACAT TTAGAGTGCT TAGCACAATA 
7751 AGAAGCACAT GGTAAGAAAT GTGGACATGG TAGTTCCTGT TCAGTCATCA 
7801 AAATCCTACA GCGCCGTGGT AGGATAACAT TATCCCCAAA TATCTTAATG 
7851 AATCTGTGAT TAAAATTCAA GGAAATTAAA TCACCAGGTA TAATGGCATT 
7901 TTTAATGAGA AATCTGGGAA AAAAACACCA TTAACAAAGT TGTGTTGTTA 
7951 CAAAATGTAA AGCGTT AGTC CTCTTGGTTT AGTGAGACGT TATAAGATGC 
8001 AGGGGACAGC CAGGCACAGT GGCTCACGCC TGTAGGCCCA ACACTTTGGG 
8051 AGCCACGGCA GGAAGATCAC TTGAGCCCAG GAGGTTTGAG ACTAGCCTGG 
8101 GCAACAAAGT GAGACCCCAT CTCTACAAAA AATTTCAAAA TTAAGCCGGG 
8151 CATGGTGGCA TGCACCTGTA ATCCTACCTA CTCAGGAGAG GTGGGAGGGT 
8201 GGGAGGAATG CCTGAGCCTA GGAGGGTGAG GCTGCTGTGA GCCATGAGCA 
8251 TGCCACTGTG CTCCAACCTG GACAACATAG CGAGACCCCA TCTCAAAAAA 
8301 AAAAAAAGAA AGTTGAATGG GACTGTTAAA ATATGTTTGT AAATTACTGT 
8351 ATTGGTACTA TCCTGGATAA 1 1 1 1 lAAACT TTTCTGTAGA GACAGGGTCT 
8401 CCCTATGTTG CCAAGGCTGG TCTCAAACTC CTGGGCTCAA GTGATCCTCC 
8451 TACCTGGGCC TCCCAAAGTG TTGGGATTAC TGGTGTGAGC CACTACACCC 
8501 GGCCAATTGT LI 1 1 ICI I AT TCAAGTTGAG Al I 1 1 ICTGG TTCTTGATAT 
8551 GATGAGTGAT TTTTCAGTTG AAGCCTGATC ATTTTAGATA TGATGAGACT 
8601 TTGGATCTTA TTGAAATCTG CTGTTTCAGT GGTCTTCCTC TGACACTGTT 
8651 CTGATGAGGA GAGGGGGTGC CGTGACTCGT TACTGCTGGG TGTAGGAGTA 
8701 GACGTCCAGG TTCCTCACTC AGCCGCCTTT GCCTCCTGAG TGATAGGGGC 
8751 TCTTGTCACT GCAGGGCAGG GATGGGAGCT GAGGGGGTGC AGGCTACCTA 
8801 GTGTGCCTCT GCTAATGTCG CTGTGGCTAG GAGGAGCAAG GGTGCTTCTT 
8851 TCCGCTGACA CCGCCTGTTA GGCGTATTGG GATGCCTCAT TACAGTGTGG 
8901 CAAGGGTGGG AGTCTAGGCT CTGCTCAGCC TTTGCTGGGC ACCCGTTTCT 
8951 CTAAATATTG TCTAAAAGGT CTCTTTTGCT AGGCTATCTT 1 1 1 1 IGGTCC 
9001 TTGACTAGAG AGAACATGTT GAGGGATGAT CGATATGAGG CCAAAAGAAA 
9051 GCCCAGGGAA CTCACCACCA CAACATTGAT TGAATCTCAG GCTTCCTAGC 
9101 TGGTCCGCTT TCCTCTCTCT TCCTTTCACA GTCCTCTTAC ATTTGTTTCA 
9151 TATGTAACAC CCAGGGTCTT TAGCTGTACT TAGCTTTTGT AAGCAGAGGG 
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9201 AGCAGATTCA CTTAAATTAT AATACCAAAT AAAGTTAAAA AACATAAGTA 
9251 TGATAGATTT GAAGATTATA TAGATACAGA AAAATGTTTG TGAGCCCAGG 
9301 CGCAGTGGCT CACAACTGTA ATCCCAGCAC TTTGGGAGGC CGAGGTGGGT 
9351 GGATCACTTG AGGCCAGGAG TTCGAAACCA GCCTGGCCAA CATGGTGGAA 
9401 CCCCATCTCT ACTAAAAATA CAAAAATTAG CTGGGCATGG TGGTGTGTAC 
9451 CTGTTAGTCC CAGCTACTTG GCAGGCTGAG GTGTGAGAAT TAACTTGAAC 
9501 CTGGGAGGCG GAGGTTGCAG TGAGATCGTG CCACCGCACT CCAGTTTGGG 
9551 CAATAGCGAG ACTCTGTCTC AAAAAATATA TGTTTATGAA ATAAGTAAAA 
9601 AAAAATCAGA TGTGCATATT GATTACAGGT ATATAACCAG TACATAAAAA 
9651 TATTGATGGA GAACAAAAGA CCTTCACCTC TTCCCATGGA CCCACACCTC 
9701 TTAGGTCTGT TGGATCAGGG TTCATGACTC ACTGTACTTA AACTGTGTAT 
9751 GAATGTGAGC GTTTTCTGAG AAGAGAAGGG TTCATTTTCA TTAAATTCTT 
9801 CTTTCTGACT CGAAAAAGTG AAAAAAGTCT CTCTGCATGG GAGTAAGCCC 
9851 AAATATTTGT CAAAAAACAA GTTGTGATTT ATTCAGACAT ATAAATATTT 
9901 AAATTTATAT AAAAGCCACA TCGAGAAAAT TCTAGAAGGA TGATGGAACT 
9951 GTGTATGTAA TAATTACAAT AAGTTATAAT CACAAAAAAA CCAGCGTTCC 
14 10001 ATGGAATTGT ACAGATAACG ACAAI I I I I I TTAACAGATG GAGAATAATC 
Q 10051 ATCTATGGAA TAGTAGTTTA GAAGAACTTC ATAGAATTTT I 1 1 1 1 1 1 1 I I 
p 10101 I I I I I 1 1 I I I I 1 1 I I IGGAG AGGGAGTTTC GTTCTTGTTG CCCAGGCTGG 
*t 10151 AGTGCAAAGG TGCGATCTCG GCTCGCTACA ACCTCTGCCT CCCGGGTTCA 
|t 10201 AGCGATTCTC CTGCCTCAAC CTCCTGAGTA GCTGGGATTA CAGGCATGCA 
□ 10251 CCACCATGCC CAGCTAATTT TGTAI I I 1 1 A GCAGAGACTG GGTTTCTTCA 
j'y-' 10301 TGTTGGTCAG GCTGGTCTCG AACTCCAGAC CTCAGGTGAT CTGCCCGCCT 
10351 CAGCCTCCCA AAGTCCTGGG ATTACAGGTG TAAGCGACTG TGCCTGGCAG 
i A 10401 AACTTCATAG AATTTTAATG CTCTTTTATA TCAACTAATC AAATTATATT 
j U 10451 TGCTTCATTT TGGGGAAACG TGTAATTTTG ATTTGTTTTG GGGI I I I I I I 
^ 10501 GAGATAAAGT GTCACTCTGT CGCCCAGGCT GGAGTACAGT GGCTCAATCT 
£ 10551 TGGCTCACCA CAACCTCAGC CTTCCGAGTA GCTGGGACTA CAGGCGCCCA 
[J 10601 CCACCACGTC TGGCTAATTT TTGTGI I I I I AGTAGAGACG GGGTTTCACT 
10651 ATGTTGGCTA GGCTGGTCTT GAACTCCTGA CCTCAGGTGA TCCACCTGCC 
10701 TCGGCCCCTC AGAGTGCTGG GATTACAGGC GTGAGCCACC GTGCCCGGCT 
10751 ACAATTATAG TCTCTTGCAC AGAAGCCAGC TTGGTCAAAA TTCAGGTCTT 
10801 CTTGGGTCCT CCTTTTGAGG AGTGTTCATG CTGTCCTTCC ATCTTGCAGT 
10851 TACCCTGACT TCTAAGAATG CAACCCGAGC TTGTTTCCCT GTTGAGGCCA 
10901 CTTGGCAGTT ATATGAGGGA CTGGGGACAT CTGAGATCTC TGGGACTCAT 
10951 AATAATTTTC TTTAAAGTTT TAGTAATTCC CCAAATGTAA GATAATCTTG 
11001 TATTCTGAAG CAACCCGTCA CATAGAAGAC ATTAAGAAAA CATTGATTAA 
11051 GAGAGGTAGA TGCTATTTTC CAGAAACAAC CGI I I 1 1 ATA TGAAAAGGTA 
11101 GGAACCTTTC I 1 1 I IAATGA TAGGGGCTTC TTTCAAAAGT TATTTTGCTC 
11151 TTAGGTGTCT I I I I I I I I I I TTTAAACATC TCATTCATAA ATAATTAAAA 
11201 ACTTATGGGA AAGTTGCAGG GAATAGTACA GAGGACTCCC ATAAAGTCTT 
11251 TTTTGTTTGT TTGTTTTGTT TTGTTTTGAG ACAGAGTCTC GCTGTTTTAC 
11301 CCAGGCTGGA GTGCAGTGGG ACAATCTCGG CTCACTGCAA CCTCTGCCTC 
11351 CCGGGTTCAA GCAATTCTCG GGCCTTAGCA TCCTAAGTAG GTGGGATTAT 
11401 AAGCATCCGC CACCACGCCC AGCTAATTTT I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I IG 
11451 TAI 1 1 1 IAGT AGAGACGGGG TTTTACCACG TTGGTCAGGC TGGTCTCAAA 
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11501 CTCCTGACCT CAGGTGATCC acctgcctcg gcctccaaaa gtgctgggat 

11551 TATAGGCGAG AGCCACTGCA CCCAGCCCCA TGTAGTCTTT TTAAAAAGCA 
11601 GGCAACTCAG GTTTACTAGT TAACATGCAA AAAACTGCAC ATATTTAAAG 
11651 TTTGGTAAGC TTTGACATGT AGACACCCGT GAAACCATCA CCACACTCAA 
11701 GATCATGGAC ATATTCATCC CAAAAGCTTC CTAGTGGTCA CTCCTTCCTG 
11751 CCCCTCCTCT ACCCCTGGCG ACAACTTACC TACTTCTACT AAAGATAAAT 
11801 TAGTTTGCAA ATGGAACCAT ACAGCATATA CTAGTATTTG TTGTCCTGGC 
11851 CTCATTTACT CTGTATAATT ACTTTGAGAC TCATCCATGT TCTGTGTATC 
11901 AGTTTATTCC TTTATTATTT TTGAGACAGG GTCTTACTCT GTTGCCCAGG 
11951 CAGGAGTGCA GTGGTGCAAT CATAGCTCAC TGTAACCTTG ACCTCCTGGG 
12001 CTTAAGGGAT CCTCATGCCT CACAATGTGC TGGAATTACA GGCGTGAGCC 
12051 ACCACACTGG CAATGTTTTG TTTCTTTATG AAGATGAATA AAGATTTCAC 
12101 ATGAAI I I I I TAAGATGAAA CATGCTTCAT GCATGCAGGT TTCTTTGGGC 
12151 GTATTCATGC CCACTCCCTC TGGTTGGAGC TTTGTCAGAG AAGTGTGAGC 
12201 AG I ICI I ICC TAGGCCATAG GTGAAAGATG CGCATGACAC GOTAGCACT 
12251 GTCCTTGCGG TTCATGAGGC ACATACATCT TACTGCCCCG TAGTAAAAAT 
12301 TCAGTCTTTC CAAGCGATTA CTGTGTGAAG GACATTTAGT TCCTTCACCT 
12351 ATTATTGGGG ACATAAGTAA CTGAAAGCTT TGAAGCTTTG TGCTCACCTA 
12401 GAAATGTGCA GCATGTAAAC TTTCTAGAAA ATGTGCTGCT CTTTAGACCT 
12451 TGTAGCCACT AAGCAGTTGC ATATTGAGTT TCCCATTCTC CCTGCTGTGT 
12501 TACTTTGCAG TCTGGTGCCA TCATGACAGT CCTCGCAGCT GTCTGCACTA 
12551 AGATCCCAGA AGGGAGGCTT GCCATTATTT TCCTTCCGAT GTTCACGTTC 
12601 ACAGCAGGGA ATGTAAGTAT TTTTATGAAG TGCAGTGCTG GGGATAGTGG 
12651 TGATGTTTTT ATGTTGAGTG GGTTCTTGCC CTTAAGTTAG AAATGTCAGT 
12701 GCTGGAGCAA TCACAGTTGT GCCGCTTGTT TCTTGCTGCC TTTCAGGCCC 
12751 TGAAAGCCAT TATCGCCATG GATACAGCAG GAATGATCCT GGGATGGAAA 
12801 I I I I I IGATC ATGCGGCACA TCTTGGGGGA GCTCTTTTTG GAATGTAAGT 
12851 TTGAGTGTAA TTGATTGCTA AACTGCTTCC TTGGGTCATG CGCTCCTCCT 
12901 ACCCCAGCCT CACCCCTACC CCCCATCCCC ATGGCAGAGA CATTGAACTA 
12951 TGCAACGGAA GCAGAAGCAG GTGGGCTTGG GAGGGTGAGG AAACCTCAAC 
13001 ATGGCTTGCT TTGGGTTTAC CCAGCATACC TGGCTCATTG TAGAGACAGT 
13051 CTGTGCCTTT ACCCTACGCT TAACCTTAAG TTGCCCCAAC TGTTGGCCTG 
13101 TTATTCCCAG CCCCCTCTTA GAAGACTGCA GCCTGGCCCC CAGTCTATGC 
13151 TGACATCTTC 1 1 1 1 ICCCCT TCAGACTTTC CTGCCCTCCT CTCCCCTGCC 
13201 TGGCGTCCCA CCCTGCTACC CTGACCTCTG TCTCGCCAGT GCTATTTAGA 
13251 CATGCTGAGT TGGCGGAGCC ATTGCTCTGT ATGACTGGAG TAGAGGCCGG 
13301 TGACTGCAAA CCAATGTGGA CCACTTACTG AGTACCCGCT GTATGCAGGC 
13351 ACCAAGCTAG TTCCCTTATG TTATACTATT ACTACTCCCA TTTTACTGAT 
13401 GGGAAACTGA GGCTCAGACA TCATCTTCCC CAGGCCAAAC AGCTCTTCAA 
13451 TAGCAGAGCA GAGCTGTAAA CCCACCTCTA TAAGCCCTTT CCACCCCCAC 
13501 CACACCATAT GGAATTGGTT GCTAAACTGC TTCCTTGGGT CACAGCAAAT 
13551 GGCATTGTGG TTACAAGACC TTCCACGTGT GCTTCAAACA ATGGGGTTTT 
13601 GCCTAGACTA GTGCTTAGTA GTAACTGTAT CACGGAAACA CGGTCAGGAC 
13651 TCTTGGCGTC CATCTGATCG TGGGAGACCC GTCAGCATGA GCTGGATCCC 
13701 CTCGGGGCCT GTCTTTTCTT ACATAAATGT TGCCTTTTGC CCTTACTTGG 
13751 I I I I IATTTT GTTCCGCGAC AATGGAAAAC TTAAI I I I I I I I I I IATTAA 
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13801 AAAGAAAAAT CTATTCTGGC CAGGTGCAGT GGCTCACGCC TGTAATCCCA 
13851 GCACTTTGGG AGGCCAAGGC AGGCGGATCA CAAGGTCAGG AGATCGAGAC 
13901 CATCCTGGCT AACACAGTGA AACCCCGTCT CTACTAAAAA TACAAAAAAC 
13951 TTAGCCGGGC GTGGTGGCGG GCGCCTGTAG TCCCAGCTAC TCGGGAGGCT 
14001 GAGGCAGGAG AATGGTGTGA ACCCAGAAGG CAGAGCTTGC AGTGAGCCGA 
14051 GATCACGCCA CTGCACTCCA GCCTGGGCGA CAAAGTGAGA CTCTGTCTCA 
14101 AAAAAAAAAA AAAGAAAAAT CTATTCTAAG TGAAGCAGTT TTTCCCAGTA 
14151 GGTGGCAGAA CTAAATGCCA TTATGCCATT TATAATTTTA AGTGATTAAA 
14201 GAGGAGTAGT ATGTAGTATA TGCAAGGTCT AGCTCTAACA GCAGTGCAGT 
14251 ATAAATAGTA GAAACTGACC TGATATTACA GTATGAGAAA CATGAAGGGG 
14301 TTCTGTTTTG TGAGCTCTAA ATTTATCTTC CATGTATACT TCAAGGCTCT 
14351 TCTCCCCAGT AGAI 1 1 1 I AT TCATCTGAAC TATAATTAGG TGGCC I 1 1 1 1 
14401 CCATTCTGAA AATAATTGGA TCAAATGCAT TTTAAAGTCC AGGGTCTGAA 
14451 AGGTGGAGGA ATCCTTTCTC TTTACTGTTT CTAATTTAAA CTCCTTTTCA 
14501 TTTACTAGAT TTCAGTCATG TCCAGAATTC ATCTTTTCTA AAAGCTTTAA 
14551 TCTAGATTTA GAAATCTAAA ATCTTTTATT TAI 1 1 I I 1 1 I TCGTTGAAGT 
|.* 14601 GCCCTGATTT TGTTGGTGGT AAAGACTCCA TTAGTATCCA CTTATACATT 

□ 14651 TCCCTGACTT TGCCTCTGAC CAAACCTTAC AGTATTCACA TTGTACTGTT 

□ 14701 GCAATAATAA TAGCTAACAT ATTAATACAC TGAATATTTG CTGTGTGCCT 
14751 AAGCTAAGGA TTTAATTCTC TTAAAATCCT GTGAGGTATT TTATTTTACA 
14801 GAAAAAGAAA CTGCTTAAAG AAAGTAACTT ATCCAGGTCA CACAAGTAAC 
14851 AATTGCAGAG CTGGAGTTTC AGATGAGGGC TGGCTTGCGC TGCCGCTACA 
14901 GAAAAGAGTG CCCTAGAAAT CGGTCATCTT GCATTTCCCG ATTTTAGTTT 
14951 AGCCAAATGA AAAATTCCTT TTGGATTTAT GAGTATAATC AGACAGTATA 

K 15001 CCTGTGAAAT TAAAGTATTT GACTCTTTGC TTGAAATAAG TAGGTTAAAA 
iU 15051 AGATTTGGGT GGCCGGGCGC AGTGGCTCAC GCCTGTAATC CCAGCACTTT 
if 15101 GGGAGGCTGA GGCAAGTAGA TCATTTGAGG TCAGGAGTTC GAGACCAGCC 
;F 15151 TGACCAATAT GGGGAAACCT CGTCTCTACT AAAAATACAA AAATTAGCCG 
p 15201 GGCGTGGTGG TGCATGCCTG TAATACCAGC TACTTGGAGG CTGAGGCAGG 
• 15251 AGAATCACTT GAAGCCAGGA GGCAGAGGTT ACAGTGAGCT GAGATCACGC 
15301 CACTGCACTC CAGCCTGGGC AACAGAGCGC GACTCTGTCT AACAACAAAA 
15351 AAGATTTGGG AAAACACTTT ATTAATGAAG AGTTCCTGAC AAAGTGATTT 
15401 TTTTGGGGAG AAI I 1 1 I ATA ATTGCATTTG AATATTAGGG TGCTCCTTTT 
15451 TCTCTCATTC TAAATTCACC AGAGACTTAA GCACAGAGAA 1 1 1 1 IATTAC 
15501 ATGCCTGTTA ATTAATGTGT ATAATCAGAT TTTAACTATA TTTAGTGAAT 
15551 ATTAAGATTC AGGTACAAAT CAAGCCCTTT ATAATTAAAC ATACACATTC 
15601 AGAACATTTT TAAAATATTA AAACATTAAA CTGCTCTTCT CACCCACTCC 
15651 AAGTCAAATA GCAI 1 1 1 I IC AGTCAGGTGT CTGGGAGCTC GATGCAAGAT 
15701 AACAAAATCT GGTCTCTGCC TCAGGGAACA TGAAATCTGT TTGGGGAAGC 
15751 CAGAGCAAAA ATAAAGGTTT TAATAGCAAG CTCTCACTAA CTGCCCCTGG 
15801 AAATCCACCC CACATCCTCC AGGAAGCCTT TCTCTACCCC CAGTGCCCTC 
15851 AGGAGCTTCT CCAAGGCAGG CCCTTCCCAG AGCGCAGTGT GCTCCCCAGC 
15901 TCACAGGAGA TGCTCCCTAC ACGCTGCAGG AAAGTCCAGT GCCTGCAGCA 
15951 CAGGCTTCAG CAGCAGACTC GGGTTCTAGT CTCAGTCTGC TGATTCCTAG 
16001 TTGTGGAACC TGAGCAGGCG AAGTTACTAA ACCTCTCTGT GCGTCAGCCT 
16051 CCCAGGCTCG TTGCTTCAGG CCGCAGTTAG GCTGTGTGAA CAGGAGAGTG 
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16101 GGGATGGGAA CTAGGTATCT 
16151 CACCOTCGT ATAGTTAGGA 
16201 TAGCCATCCT GAGTCAGTGC 
16251 TCTGACCTGC GAGTGAGCTT 
16301 TGGACAAACT AC 1 1 1 C_ 1 1 IC 
16351 AAACTGATAC TTAACTTGCA 
16401 AAGTACTCAC ACACTGCGGA 
16451 CCTTTTACAT ATTGTAATAT 
16501 GTTTTAATCG GCTACATAGT 
16551 TGQGATCTGT CACCATGCTA 
16601 GTCGGACCTG GTTTGTTTTT 
16651 ACTGATTTGG AAGAACAGGG 
16701 GGACTAATGG CCCCAAAAAA 
16751 ACAGTAGTGG TGCATCTGGT 
16801 GGCTAGAGTG ACCATGGCTA 
16851 CCTCCCACTG TTTTCAGCTG 
(,4 16901 GAATGATGAT AAAGTTGTGA 
i;g 16951 ATGTGTGTGT TCTCTCTTTA 
Q- 17001 TGGTTGTTGC ATTGACAGGA 
M 17051 TCAGAACATG AAGGCGCTGG 
: P 17101 GCTTAGCCTT CACTCTTCCT 
j ;0 17151 TAAAAGTCAA GAGTATCCCC 
jjj. 17201 GTATTTTCTC AATTTTCAGG 
[. 17251 TATACAAAAA GCTATTTTCA 
U 17301 AAATTAAGAG GTAGAAAGAA 
|y 17351 GAGCCGAATT TTATCTTCTG 
M 17401 TGAAAATAGC AGATAGTGGC 
„| 17451 ATTCATGTTA CCCTCGTGAC 
U 17501 GAGTATGTTC TTCTTGAAGT 
17551 GGCTTCAAAA TGTTATACCA 
17601 AGGCCCTGAT TTCAGCCTCC 
17651 TGGGGGATGG GAATGGCGGC 
17701 TCACCAGGCA AGTGAGAACT 
17751 TTGTCCAGTA TTTGGCAGTC 
17801 TGTTTCTACT ATGATTTACA 
17851 TAAACTCGTA TCACTTCTAG 
17901 CAGCGAGGAA ACGGCACACG 
17951 GCTTTGCTCT GACATCTGCT 
18001 GTTTTCCAGA GATGGGATAG 
18051 GAGCCCAACA TTAATTCACA 
18101 AATGAAGACA TCTCTGTGTC 
18151 TCCCTGTAGC ATCTCCTGGT 
18201 TGCATGGCCT GGGGAGGCCA 
18251 GTCCACAGCT CCCTCCTGAT 
18301 TGGCTTCTCT GGGGACCCGC 
18351 GGCCCGTGGG CCTCTTGGGC 



TAAAGCGGGG CAGAGTTTGG ATGAGCGGGC 
GGAAGATGAC GGGAGGCATG GAAGCTGGGA 
TAATTCTGAC ACTTCAGAAC ATCGAGTCAG 
TCATTGACCA CTTAGAAACT ATTAGCACCT 
AGACCTGGTT GCTTCATGTC TGCGATGGGA 
GATAGTGGTG AATCAAAAGT AGTATATGTG 
GCATTCAGCC ATCGTCCCAT CCTACTTCTA 
GAAAGCTAAA CCATTTCTCG ATGTGAGTCA 
GAGTGGCATT CGATTTTAAA AATGTCAACT 
CTTACCATTT GTATGTCACA CTGTTTGAAT 
CTCCAGATGG TATGTTACTT ACGGTCATGA 
AGCCGCTAGT GAAAATCTGG CATGAAATAA 
GGAGGTGGCT CTAAGTAAAA CTGGGATTGG 
CCTTGCCGCC TGAGAGCCCC AGGAGACATC 
TGCTCCCGTC TGGAAGATGC CAGCATCTGG 
TGTCCCCCAG TCCGTGTCTT TTTAGAATGT 
AATAAAGGTT TCTATCTAGT TTGTAAGCAG 
AGGGGCCGAC ACGGCTCTGG CATTTTGCTT 
CCTGGGGAGA GTGCACCCTG AAAGGCCTGA 
TTGCCTGTCT TTGGACCCTC CAGTGCCTCT 
TGCCTCCCCC TCCCCTGGGT TGGCTGCACA 
TCTCCAGCAC AATCTGAAAT AACAGCTGCA 
AAAGGTAGTG TTTTCTGGCA GTGAGTGGCA 
GGTTTTGCTT TCTAGGTTCA ATTTGTAGAT 
GTGATTTGGG TAAATTCAGA CTTGAAATCT 
TTTGAAAGTG TTCTAATTGA AGCGTCTCAC 
TGTCGTCGTC ACAGCCCTCA CTGTTGTGGA 
TGAGAATGAC ATCTAGGAAA TGCAGTTTGA 
CATTTACAGG AGAAI I I I IA GTCTTTTGAT 
AGTCTTGCAG CTTTGTCCTG GGAGGATCGA 
TGTGGCCGAT CGGACTCAGG TTGTGTGCCG 
TTTGGAAAAG GAGTGGGAGT GGTGCCCACC 
GCATGGCAGC ACGCGCCCAG CACATAGAAA 
CTTCATATCC TTCTTCCATC AGGCTGGACT 
GTTATTCTTC CCAGGCACAG GATTCTGTTC 
GGGAGAGAGT TATCTTAGCC ATCATTTTGC 
TGGTGTAGGG GCACTGCCCA AGGTCACAAT 
AACAACTGCA ACACAGATGA GGCAAGATGC 
GAGGCTGAGT TCATAGGGAC ATTCCCTCTA 
TCGTGCTTTG GGCAGACCAG GCAAAGAGGC 
CCTGCTTTGT GACTGGGAAA AAGTTAGAAG 
CCCTAAAACC CCTCAATGCT GGAGCCTCTG 
GAACCTGGCT GTGGCCGGAG AAGCCTTGCT 
TGCCCACGAG GGTGCTTCAC TTTCTCCTCT 
GATCACTGCC TTCAAGGCCA TGCACTCCCT 
TGTGCCGCCT CCACTGGCAT CTGAAGTGTG 
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18401 GGGTACCTAG GAACATGCCG TGGCTGCCGT CTCCCTCATT CCATACACTT 
18451 CTTGAGTGGG TGCACTTGCT GAAGCCTCAG TTATCTGTGA GGATTCTGAG 
18501 CTCCAGACCC ACAGAATCTC TCTGTACTCT TAGTAAATGT GTCTACTGCA 
18551 ACACACGCAT GGTTCCAGGC TCTGGGACCA CCCCCCCGCC CTGCACAGGC 
18601 CCCTCAAATA GCACTCGGCT TAAGGAGTGA CACGAGCAAT CGGTGAAGTC 
18651 TGAAACCCGG AGCCATTCGA GATCTCCCTC TCTCGCCTCT TATTTCTAGA 
18701 ATTCAGCCCC TCAGCCTTCC CAGTGCCTGT GACTCCGTGG TGGTCCTCAC 
18751 TTCTTAGTCC CTGGACTGTT GAGCCTGTTC TTCCAGCTGG TCTCCAAAGC 
18801 AACCCTGTGC TTCTCCATAT GCCTGCCAGA GTGCTAAAAA CACGTCTGTC 
18851 ATTCCTTTGT TGTCACCTGT GAAAAACTTT TATTTATTTG AGACAGGGTC 
18901 TCTCTCTCTC TCTCTCGTCC AGGCTGGAGT TCAGTGGTGC AATCTAGATG 
18951 GTCACTACAC TCAGGGAGTT GGGGATGGCT CAGAGCTGTT AACAGAGAGG 
19001 GGACTGCCCA GGAGGACCTG CGTGAGGGGT GGGGGTGGGA TGACAAGGAA 
19051 CCAGCTCTGG GAGTTGAAAG ACCTGGATTC AAGTCTCAAC CCAAGCCCTG 
19101 GCCAGCTCTG GGACCCCGGA CAAGTCGGCC TCACTCTCTG CCCCTCAGTG 
19151 GGCTCCTGTG TAGATGGGGA TAATGATGGC TTTATATCCT GAGAATGTGG 
U 19201 GGAGGGGATT AAGTGGCCAA AATACCTGAG AGTGCGCACT CAGTGCCTGG 
p 19251 CTCAGCAAAT GCCCTTGTTC CCTCCTTCCC TCTCCCCAGA ACCCCTCCTC 
Q 19301 CCCTTCTTCT TCI 1 1 I I I I I I 1 1 I I 1 1 I I I TGACCCAGAG TCTTGCTATG 
I'j 19351 TTGCCCAGGC TGGAGTGCAG TGGCACAATC TCGGCTCACT GCAACCTCCA 
T 19401 CCTCCTGGCT TCAGGCAATT CTTGTGCCTC AGCCTCTCGA GTAGCTGGGA 
19451 TTACAGGCAG GCACCATCAC GCCCGGCTAA 1 1 1 I 1 1 I I 1 1 I I I 1 1 I I IGT 
19501 AGTAGAAATG GGATTTCACC ATATTGGCAG GATGTTCTCG ATCTCCTGAC 
19551 CTCAGGTGAT CCACTCGCCT TGGCCTCCCA AAGTGCTGGG ATTATAGGTG 
4 19601 TCAGCCACTG CGCCCAGCCC CCATTGTTTA TCTCCTCTTC CATTTCTTGT 
fU 19651 GGGGACTTTT AAAGGAAAAA TCAGGTTGGT GGGCTGGGGG AGGGCATAGC 
!* 19701 TGAGACCACC TTGAGGGCAC CAAGCTCACT GACCAC (SEQ ID NO: 3) 
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19244 


T 


C 


Beyond 


ORF(3') 



Context: 
DNA 

Position 

237 CGAGGI I I CI I (^TGTTGGTCAGGCTGGTCTCGAACTCCCGACCTCAGGTGATCCGTCCG 

CCTCAGCCTCCCAMCTACTGCTGGGATTACAGACCT 

CTTTCAI I 1 1 1 1 1 I (^TCTATTTTCCITTATTTTMTa\CTTTATCCAGAM(^TATCCT 
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CGTOTGACAGTGCTGrGGTGCCTGTGGTTTC^GMGCTGGGrGrGa'GrGTGrC 
[T,C] 

CTGJTITGAGGMGTTGCCC^ 

TCAGCCTAGTAGGCAGGATCAGGGACCCCA^ 

GAATAAMCACCAMGCCCTGACTGATCATGATCATAGCMTCCGAT^ 

GCCAGACCATTCTC^GGTCGTCTTTACCCrMGATATCAATCACTG 

GACCTMGGGTGCACTCTGGGTAGTAMGATGAT^^ 

MGGGTGCACTCTGGGTAGTAMGATGATTMCTCTCCCAAA 
AGCMCACGMTCACTGCrCTClTCCTATAGGGTAMCCTCCCMGACrC^GTCCCr^ 
GAGGAGGCTCrGCCCGCCTGCCCrrCCCAGGGTTCCAGGCTCCACATTGGGAGGT^^ 
CAGTGCTCITCGCTCTTCATTGCC^^ 

CTGTCCCTCTCACCATCTTTAAM ATAAT 
[G,T] 

CTGTCACAGTCCCTGCrAGrGAGACATCTGATACMCTGATGGMTCAGTTCAA^^ 

GC^GTAAAATTTTATTTAATGTACTACGGAGAAAGAAAAAATGCT 

CATCCTGATTTG^GATATTAAAATGGAAAAMTGTCTTAAGATCTGTG^ 

TCCTTTCCCACCrCTCAAGTGGGAGAGCAAAAACTGGACAGACTAG^ 

AGCTGAGAACCTTACAGMTGAGCMCTGCGGMGCCACAGGTAACACCGAGA^ 

CTACCAGTTATMGATGCATCCTGAT^ 
CTCTGAAAMTCTAGCTTCCTTTCCCACCT 

TAGAMTGCCAGGGGCTAGCTGAGMCCTTACAGMTGAGCMCTGCGGMGCC^ 
MCACCGAGATGTAGAT(^GCTGCCAGGGACM 

CTCTTACCAGTATGrrATTGAMTCAGTCCTTATTGGC^TCGMGMGGTGAMGTGCTA 
[C,T] 

TTGCCTGTTGCCTACAGAGACTGGAGGMTGACAAATGTTTAMTTA^ 
AGrAGAGGAATACCrGCrATCTGMGGAGTTCTGGG^TTC^TAAAATTAATATA I I I I I 
TGAAGTTTGTACTTTTCAATAATAA 1 1 Id I ATCTAAAATGTAACAAGTTAATTATATTA 
TCGMTAAACCTCMTTTCGTAGTACTM 

ACTCMCTCCCACATGTAAACAGACTTTA^ AAATTATCC 

TGGAAAAMTGTCTTMGATCTGrGAAAMTGTAGCTTCCrrrCCCACCrCT 

AGAGCAAAMCTGGACAGACTAGAMTGCCAGGGGCTAGCTGAGMCCTTAC^ 

CMCTGCGGMGCCACAGGTMCACCGAGATCT 

MTGTTTTCTAMGTAMTCCTCTTACC^^ 

CGMGMGGTGAAAGTGCTACTrGCCTGTT^ 

[-,A,T] 

TAAATTATTTTMTTCMCAAGTAGAGGAATACCTGCTATGTGM 
CATAAAATTAATATA I I I 1 1 I GAAGTTTGTACj I I I I CAATAATAA 1 II U I ATCTAAAAT 
GTMCMGrrMTTATATTATCGAATAMCCTCAATTTCGr^ 
CTTACAGAAAMGGAMGTCACTCMCTCC^ 

AGAGGI 1 1 I CTAAATTATCCCTGAATTCCTATCACATGACrA I I I 1 1 CTCAGACATGTTG 
TCAGTCCTTATTGGCATCGAAGMGGTGAMCT 

GGAGGAATGACAAATGTTTAMTTATTTTMTTCMCAAGTAGAGGMTACCT 
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GAAGGAGTTGTGGCMTTCATAAMTTAATATA I I I I I I GAAGTTTGTAG I I I ICAATAA 
TAAI I I CI I ATCrAAMTGTAAG\AGTTMTTATATTATCGAATAMCCTCAATTTCGTA 
GTACTAACMCATCAACACTTA^GAAAMG^ AAACA 
[T,C,G] 

ACTTTAGMGCAGTTGCAGAGGTTTTCTAMTTA^ 

TTTTCTCAGACATGTTGACCTTCACCrACACAGATGACn^ 

GCAGTMGTTTMGMGCATACCATGCC^ 

CTCTTGGCCAMGMCCTMTTCTGTATATTAC^ 

CMTAMTTATTGATCrGATTATMTTGAGAAMGTMGCTCTTCTAM 

392 5 GCCTTCCGAGTAGC^GGAATTACAAACGTGCGCCACCACACCTGGCTAA 1 1 I I IATATTT 

TTAATAGAGATGGGGTITTGACrATGTTGGCCAGGCTGGrCTTGAACrCCT 
TCCGCCTGCCTTGGCCTCCCAAAGT GCTGGGATTACAGGCGTGAGCGXCTGCGTCCGGCC 
TAATTTTAAAAGTTTAAAATGGATAA I I I I I ATTGGCTGTGTGTTTG\TGATTACCAGAC 
TATGTTTCTCrCTCTTGTAGAGGrCL I ITG1 I CTCCAATGTTGCT GTCAACATTCAGTCA 
[C,T] 

M TTCTCCXrATTTCACATGGCAGOWVTATGTATGrrTTOT 

□ GTGMCATTCTGGGTCAAGAGCAGrrC^TGGCAGTGTACCrATCT GCAGGTAATATGCTT 

□ TMTCTCGGGGCCTITGAGAGrATMGC^CrCTAAGCrATCT GCAGAACGGACAAAGGGA 
l "t ATGATTACTGCCATATTCTACACGTAGTGACTGCTCAGAACATA I I IGI I I CTCACAGTG 
:q TATGTAGAGMGGGAGCCACAGATTGGTGGAGATGTTGCCTTTTCTOT 

j'y 5539 ATGAGTCTTCATGTTATAGTTGAGGAAMT^ 

TCGCTCAGCAGATCATGG\GCAGGTO 
\,a TAGGCAGGAGAAGTATTC^TGCCCGTACCCTGXCAGTGATCCAGATTGAMCCC 
!'U GTTTATCTGTGTAGAAATCAGAMTGAAMCCATTTTCATGGCTGGA 
I'* GCCTGTMTCCCAGCTACTCAGGAGGCTGGGGGACMGMTMCITGAA AGGCA 

f [G,C] 

p. AGGTTGCAGTGAGCCAAMTTGTACCACTGCACTTCAGCAGCCGGGGCGAM 

!4 CTCTCTCTCAAAAAAAAAAAAAMGAAMGAAAAA 

ACTTAAATTATTGTAATGTGAC I IGI I 1 1 I CAGGTGTTATTTCCAATTTTGTCAGTT ACG 
TGGGTAAAGTTGCCACAGGA^GATATGGACCATCACTTGGTGCAGTAAGT ATTTCTATTG 
TAAAI I I I I I I I AATTTAAI 1 1 I I AAATTTACTTTGAMTMGTTTAGACTTAGAAGAAT 

7220 AGAAAAAAA A I I 1 1 I I I AAGTGTCTTTTGAGTTTMTGGCAGATTTCTGGGCACATGGAA 

ATCTTTATGTMTATTTCCTTACACATTCAGTTTGTACrTATTTA^ 
AAATGCATTCAAATAGGGMTTTCCTATTTAMGGM 

AGAATTCTTATGTAAMTMCCATTCCCTAATTTGT ATGTTCCCCAAA I I IGI I IACACT 
TMTTTTCCTAGTGAGGCCTGTGTTCTGTCCTGTGACCACATGG I MCI I AAGCCTCCTT 
[T,C] 

TTTCCCTTCGTGGAATGTTTAI I I I CI 1 1 ATACAATTTCGCTCTGATATAATTTATATAT 

TTCGMTCATATTGTCTACCTCATTCMCAGCTMGC^ 

GACCACTAGGATGAATCAGAGACTCAGMTTCGMTTTAGCT^ 

CATCTMTACACACTGAAAGGMTGAGGATTCTCTAGAGGACT^ 

GAAGAGACCTTTCTAATTAGCTGAMGGACCTGCGAGGGCATTTTGATCT 
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GAAMGAATTCTTATGTAAAATAACCATTCCCTAATTTGTATGTTCCCCAM I I Ibl I lA 
CACrTMTTTTCCTAGTGAGGCCTGTGTTCTGTCCrGTGACCA I I I <_ I IAAGCCT 
CCI 1 1 1 I I CCCTTCGTGGAATGTTTA I INCH I ATACAATTTCGCTCTGATATAATTTA 
TATATTTCGMTCATATTGTCTACCTCATTCMCAGCTM 
GTGMGACCACTAGGATGMTCAGAGACTCAGAATTCGMTTTAGCTGGGGAGAA^ 
[G,A] 

CACACATCTMTACACACrGAMGGM 

GAGTGAAGAGACCTTTCTMTTAGCTGAAAGGACCrGCGAGGGCATm 
GGACAGCTGTTGTCCTCATCTTATAG^^ 
GCAGTMGGTATTTGGAGTTAGAGTGGGGGTGAATC^ 
TTCTAGGACATATTACTGMCTTCT^ 

GGCrCTTGTCACTGC^GGGCAGGGATGGGAGCTGAGGGCGrGCAGGCT ACCTAGTGTGCC 
TCrGCTAATGTCGCTGrGGCrAGGAGGAGCAAGGGrGCI ICI I I CCGCTGACACCGCCTG 
TTAGGCGTATTGGGATGCCTCATTO^ 
GCaTTGCTGQGCACCCGTTTCTCTAMTATTGTCrAAM 

LI I 1 1 1 1 I GGTCCTTGACTAGAGAGMC^TGTTGAGGGATGATCGATATGAGGCCAAAAG 
[A,C] 

MGCCCAGGGMCTCACCACCACAAC^TTGATTGAATCTCAGGC^ CCT AGCTGGTCCGC 
TTTCCTCTCrCrrCCTTTCACAGTCCTCTTAC^ I I IGI I I CATATGTAACACCCAGGGTC 
TTTAGCTGTACTTAGC I I I I GTAAGCAGAGGGAGCAGATTCACTTAMTTATAATACCAA 
ATAMGTTAAAAM^TMGTATGATAGATTTGAAGATTATATA 
TGTGAGCCCAGGCGCAGTGGCTCACMCTGTMTCCCAGCACT^ 

ATTGATGG^GMCAAAAGACCTTCACCTCTTCCCATGGACC GTT 
GGATCAGGGTTCATGACTCACTGTACTTAAACTGrGTATG 

AGAGMGGGTTCATTTTCATTAAATTCI ICI I I CTGACTCGAAAAAGTGAAAAAAGTCTC 
TCTGOXTGGGAGTAAGCCCAMTATTTGTDWW^CMGTTGT GATTTATTCAGACATA 
TAMTATTTAAATTTATATAAAAGCCACATCGAGAAMTTCTAGM 
[T.C] 

GTATGTMTMTTACAATMGTTATMTCACAAAA 

AGATAACGACAA I I I I I I 1 1 MCAGATGGAGAATAATCATCTATGGAATAGTAGTTTAGA 
AGAACTTCATAGAA I I I I I 1 1 1 I I I I I I 1 1 1 1 I I I I I I I I I I I I GGAGAGGGAGTTTCGT 
TCI IGI I GCCCAGGCTGGAGTGCAMGGTGCGATCTCGGCTCGGTACAACCTCTGCCT 
CGGGTTCMGCGATTCTCCTGCCTCAACCTCCT GAGTAGCTGGGATTACAGGCATGCACC 

ATTTAAATTTATATAAMGCCACATCGAGAAAATTCTAGM AT 

GTMTMTTACMTMGTTATMTCACAAAAAMCCAGCGTTCCA^ 

AACGACAAI I I I I I I I MCAGATGGAGMTMTCATCTATGGAATAGTAGTTT AGAAGAA 

CTTCATAGAA 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 GGAGAGGGAGTTTCG I ICI I 

GTTGCCCAGGCTGGACTGCAMGGTGCGATCTC 

[G,A,T] 

TCMGCGATTCTCCTGCCTCMCCTCCTGAGTAGCTGGGATTAC^ 
GCCCAGCTAATTTTGTAI I 1 1 1 AGCAGAGACTGGG I I ICI I CATGTTGGTCAGGCTGGTC 
TCGMCTCCAGACCTCAGGTGATCTGCCCGCCTCAG^ 

GTGTMGCGACTGTGCCTGGCAGMCITCATAGAATTTTMTGCrC I I 1 1 ATATCAACT A 
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ATOW\TTATATTTGCTTCATTTTGGGGAMCGTGTMTTTTGAI I IGI I I IGGGGTTTT 
GGMCTGTGTATGTMTMTTACMTMGTTATMTC^^ 

AATTGTACAGATAACGACAA I I I I I I I I AACAGATGGAGAATAATCATCTATGGAATACT 
AGTTTAGAAGAACTTCATAGAA I I I I I 1 1 I I I I I I I I I I I I I I I I I 1 1 I I I IGGAGAGGG 
AGTTTCGI ICI IGI I GCCCAGGCTGGAGTGCAAAGGTGCGATCrCGGCTCGCr ACAACCT 
CTGCCTCCCGGGTTCMGCGATTCTCCT^ 
[C,G] 

ATGCACCACCATGCCCAGCTAATTTTGT A I I 1 1 I AGCAGAGACTGGG I I ICI ICATGTTG 
GTCAGGCTGGTCrCGAACTCCAGACCTC^GGTGATCTGCCCGCCTCAGCCT 
CTGGGATTACAGGTGTMGCGACrGTGCCTGGCAGAACTTC^^ 
TTATATCMCTAATCAMTTATATTTGC^ 

TTTTGGGGI I I I I I I GAGATAAAGTCTCACTCTGTCGCCCAGGCTGGAGTACAGTGGCrC 

TTTCGI ICI ICI I GCCCAGGCrGGACTGCAMGGTGCGATCTCGGCTCGCrACAACCrCT 
GCCTCCCGGGTTCMGCGATTCTCCTGC^ 

TGCACCACCATGCCCAGCTAATTTTGTA I I I I I AGCAGAGACTGGG 1 1 ICI ICATGTTGG 

TCAGGCTGGTCTCGMCTCCAGACCTCAGCTGATCTGCCCGCCTCAGCCTCCCAAA 

TGGGATTACAGGTGTAAGCGACTGTGCCTG^ 

[C,T] 

ATATCAACTMTCAAATTATATTTGCTT CATTTTGGGGAMCGTGTMTTTTGA I ITGT1 
TTGGGGI I 1 1 I I I GAGATAMGTGTCACTCTGTCGCCCAGGCTGGAGTACAGTGGCrCAA 
TCTTGGCTCACCACAACCTCAGCCTTCCGAGTAGCrGGGACrACAGGCGC 
GTCTGGCTAAI I I I IGTGI I I I I AGTAGAGACGGGGTTTCACTATGTTGGCTAGGCTGGT 
CTTGAACTCCTGACCTGAGGTGAT^ 

AGAGACTGGG 1 1 ICI ICATGTTGGTCAGGCTGCTCrCGMCTCCAGACCTCAGGTGATCT 
GCCCGCCT^GCCTCCCAMGTCCTGGGATTACAGGTCTMGCGACTGTGCCT 
CTTCATAGAATTTTAATGCTLI 1 1 I ATATC^CTAATCAMTTATATTTGCn'CATTTTG 
GGGAAACGTGT AATTTTGA I I I G I I I I GGGG 1 1 I 1 1 I I GAGATAAAGTGTCACTCTGTCG 
CCCAQGCTGGAGTACAGTGGCrCMTCTrGGCrCACCAO AGC 
[T,C] 

GGGACTACAGGCGCCCACCACCACGTCTGGCTAA I I I I IGTGI I I I I AGTAGAGACGGGG 
TTTCACTATGTTGGCrAGGCrGGTCTTGMCTCCTGACCrCAGCT 
GCCCCTCAGAGTGCTGGGATTACAGGCGTGAGCCACCGTGCCCGGCTA 
CTTGCACAGMGCCAGCTTGGTCAAAATTCAGGTC I ICI I GGGTCCTCCTTTTGAGGAGT 
GTTCATGCTGTCCTTCCATCTTGCAGT^^ 

(^GCCTCCCAAAGTCCTGGGATTACAGGTGTMGCGACTGTG^ 

AATTTTAATGCTCI I I I ATATCMCTMTCAMTTATATTTGCrTCATTTTGGGGAAACG 

TGTAATTTTGAI I IGI 1 1 IGGGGI I 1 1 1 1 1 GAGATAAAGTGTCACTCTGTCGCCCAGGCT 

GGAGTACAGTGGCTCMTCTTGGCTCACCACMCCTCAGCC^ 

CAGGCGCCCACCACCACGTCTGGCTAA 1 1 1 I IGTGI I 1 1 1 AGTAGAGACGGGGTTTCACT 

[A,G] 

TGTTGGCTAGGCTGGTCTTGMCTCCTGACCTCAGGTGATC 
GAGTGCTGGGATTACAGGCGTGAOICACCGTGCCCGGCTACMTTAT^ 
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GAAGCCAGOTGGTCAAAATTCAGGrCI I CI I GGGTCCTCCTTTTGAGGAGTGTTCATGC 
TGTCCTTCCATCrTGCAGTTACCCrGACrTCT AAGAATGCAACCCGAGL I lb 1 1 ICCCTG 
TTGAGGCCACTTGGCAGTTATATGAGGGACTGGGGACATCTGAGATG"^ 



11125 TTCATGCTGTCCTTCCATCTTGCAGTT^^ 

TTCCCTGTTGAGGCG\CTTGGCAGTTATATGAGGGACrGGGGAG\TCT 
ACTCATAATAAI I I I CI I I AAAGTTTTAGTMTTCCCCAMTGTMGATMTCTTGTATT 
(TGMGCAACCCGTCACATAGAAGACATTMGAAAAC/\TT(^ 

ATTTTCCAGAAACAACCG I I I I I ATATGAAAAGGTAGGAACC I I ICI 1 1 I IAATGATAGG 
[G,A] 

GCI ICI 1 1 CAAAAGTTATTTTGCTCTTAGGTCTL I I I I 1 1 I I I I I I 1 1 AAACATCTCATT 

CATAMTMTTAAAMCTTATGGGAMGTTGCAGGGMTAGTACAGAGGA 

GTLI I I I 1 1 G I 1 1 G I I IGI I I I G I I I I G 1 1 I lGAGACAGAGTCTCGCT(ol 1 1 lACCCAGG 

CTGGAGTGC^CTGGGAGAATCrCGGCTCACTGCMCCrcrGCCT 

TCTCGGGCCrTAGCATCCTMGTAGGTGGGATTATMGCATCCGC^ 



a 

□• 



M 
i'U 

"N 
□ 



12025 



12391 



AGCTTCCTAGTGGTG\CTCCTrCCrGCCCCTCCTCTACCCCTGGC(^ 
TCTACTAMGATAMTTAGTTTGCAMTGGMCW I I ICI IGT 

CCTGGCCTCATTTACrCTGTATAATTACrrTGAGACT 

TATTCCTTTATTA I 1 1 1 1 GAGACAGGCTCTTACTCrGTTGCCCAGGCAGGAGTGCAGTGG 

TGCMTCATAGCTCACTGTMCCITGACCrCCTGGGCTTMGGGATCCT 

[A,C] 

TCTGCTGGMTTACAGGCGTGAGCCACCACACTGGCAATG I I I ICI I ICI 1 1 ATGAAGAT 
GAATAAAGATTTCACATGAAI I I I I I MGATGAMCATGCTTCATGCATGCAGC I I ICI I 
TGGGCGT ATTCATGCCC^CTCCCrcrGGTTGGAGCTTTGTCAGAGMCT 
TTTCCTAGGCCATAGGTGAAAGATGCGCATGACACGCTTA 

GAGGCACATAG^TCTTACTGCCCCGTAGTAAAMTTC^GTCrrT G 



AAGATTTCACATGAA I I I I I I AAGATGAAACATGCTTCATGCATGCAGC I I ICI I IGGGC 

GTATTCATGCCCACTCCCrCrGGTTGGAGCTTTGTCA I ICI I ICC 

TAGGCCATAGCTGAMGATGCGCATGACAC 

ACATACATCTrACTGCCCCGTAGTAAAMTTCAGTCTTTCCAAG^ 

GACATTTAGTTCCTTCACCTATTATTGGG 

[T,G] 

GCTCACCTAGAMTGTGCAGCATGT^ 

GTAGCCACTMGCAGTrGCATATTGAGTTTCCCATTCTCCCT 

CTGGTGCCATCATGACAGTCCTCGCAGCTGTCTGCACTMG^ 

CCATTATTTTCCTTCCGATGTTCACGTTCACAGCAGG I I 1 1 I ATGAAGT 

GCAGTGCTGGGGATAGTGGTGATC I I I 1 1 ATGTTGAGTGGC I ICI I GCCCTTAAGTTAGA 



13001 GCTGGAGCAATCACAGTTGTGCCGC I ICI I ICI I GCTGCCTTTCAGGCCCTGAAAGCCAT 
TATCGCCATGGATACAGCAGGMTGATCCTGGGATGGAAA I 1 1 1 1 I GATCATGCGGCACA 
TCTTGGGGGAGCTC 1 1 1 1 IGGAATGTMGTTTGAGTGTMTTGATTGCTAMCTGCTTCC 
TTGGGTCATGCGCTCCTCCTACCG^^ 
CATTGMCTATGCMCGGMGCAGMGC^GGTGGGCrrGGGAGGOT 
[A,G] 
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TGGCTTGCXITGGGTTTACCCAGCATACCT^^ 

CCCTACGCTTMCCrTMGTTGCCCCAACTGTTGGCCTGrrATTCCCAG^ 

AAGACTGCAGCCrGGCCCCCAGTCTATGCTGACATL I I CI I I I I CCCCTTCAGACTTTCC 

TGCCCTCCTCTCCCCTGG^ 

CTATTTAGACATGCrGAGTTGGCGGAC^ 

MGTTTGAGTGTMTTGATTGCTAMCTGCTTCCITGGCT 
GCCT<^CCCCTACCCCCC^^^ 
GG^GGTGGGCTTGGGAGGGTGAGGAMCCT^ 
TACCTGGCTC^TTGTAGAGACAGTCT^ 

CAACTGTTGGCCTGTTATTCCCAGCCCCCTCTTAGMGACTGC^GCCT 
[A,G] 

TGCTGACATL I I LI 1 1 1 1 CCCCTr^GAOTTCCrGCCCTCCTCTCCCCTGCCTGGCGrC 
CCACCCTGCTACCCTGACCTCTGTCT^ 

GCCATTGCTCTCTATGACTGGAGTAGAGGCCGGTGACTGCAMCCMTGTGGACC^ 

CTGAGTACCCGCTGTATGCAGGC7\C^^ 

CC^TTTTACTGATGGGAAACTGAGGCrCAGACATCATCTTC^ 

GGAGTAGAQGCCGGTGACrGCAAACCAATGTGGACCACTTACTGACT 

AQGCACCMGCTAGTTCCCmTGTTATACTATTACrACrCCCA^ 

CTGAGGCTCAGACATCATCTTCCCCAGGCCAMCAGCTCT 

TAMCC(^CCTCTATMGCCCTTTCCACCCCCACCACACCA^ 

CTGCTTCCTTGGGrCACAGCAA^TGGCATTGrGGTTACM 

[A,G] 

A<^TGGGGTTTTGCCTAGACTAGTGaTAGTAGTMCT 

GACTCTTGQCGTCCATCTGATCGTGGGAGACCCGTCAGCATGAGCTG^ 

CCTGTLI I I ILI I ACATAAATGTTGCCTTTTGCCCTTACTTGG I 1 1 1 IAI I I IGI ICCGC 

GACAATGGAAAACTTAA I I I I I I I I I I I ATTAAAMGAAAAATCTATTCTGGCCAGGTGC 

AGTGGCTCACGCCTGTMTCCCAGCACTTTGGGAGGCCAAGGC^ 

ACTACTCCCATTTTACTGATGGGAMCTGAGGCTCAGACAT 

AGCTCTTCMTAGCAGAGCAGAGCTGTAM 

CACACCATATGGMTTGGTTGCTAMCTGCTTCCTTa 

TTACMGACCirCCACGTGTGCTTCAAAC^ATGGGG I I I I GCCTAGACTAGTGCTTAGTA 

GTMCTGTATCACGGAMCACGGTCAGGACTCTTGGCGTCCATCr 

[T,G] 

TCAGCATGAGCTGGATCCCCTCGGGGCCTGTL 1 1 I ILI I ACATAAATGTTGCCTTTTGCC 
CTTACTTGGI 1 1 I IAI I I IGI I CCGCGACAATGGAAAACTTAA I 1 1 I I I 1 1 I I IATTAAA 
AAGAAAAATCT ATTCTGGCCAGGTGOVGTGGCTCACGCCTGTMTCCCA^ 
GGCCMGGCAGGCGGATCACMGGTCAGGAGATCGAGACCATCCTGGC^ 
ACCCCGTLTCTACTAAAMTACAAAAMCTTAGCCGGGCGTGGTGGCG^ 

CTTGCACTCAGCCCAGATCACGCCACT 

TCTOWWWWWWV^GAAAMTCTATTCTAAGTGAAGCAGI 1 1 1 I CCCAGTAGGTGG 

CAGAACTAMTGCCATTATGCG\TTTATMTTTTMGTGAT^ 

GTATATGCMGGTCTAGCTCTMCAGCAGTGCAGTATAM 
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TTACAGTATGAGAMC^TGMGGGGTTCTGTTTTGTGAGCTCTAAATTTA 
[A,G] 

TACTTCAAGGCTCTTCTCCCG\GTAGA I I 1 1 1 ATTG\TCTGAACTATAATTAGGrGGCCT 
TTTTCCATTCTGAAMTMTTGGATCAMTGCATTTTAM 

AGGAATCCTTTCTCTITACTGTTTCTMTTTAAACrCC. 1 1 I I CATTTACTAGATTTCACT 
(XTGTCCAGMTTCATCTTTTCTAAM 

TATTTAI 1 1 1 1 1 1 1 1 CGTTGAAGTGCCCTGA I ITTG1 I GGTGGT AAAGACTCCATTAGTA 

ATTTTAMGTCCAGGGTCTGAMGGTGGAGGMTCCTTTCTCTT^ 
MCTCCTTTTC^TTTACTAGATTTCAGTCATGT CCAGMTTCATCTTTTCTAAMGC1TT 
MTCTAGATTTAGAMTCTAAMTCTTTTATTTA I I I I 1 1 I I I CGTTGAAGTGCCCTGAT 
1 1 IGI I GGTGGTAMGACTCCATTAGTATCCACTTA^ 
ACOWVCCTTACAGTATTCACATTGTACrGTTG^ 
[A,G] 

CTGAATATTTGCTGTGTGCCTMGCTMGGATTTMTTCTCTTAAMTCCT 
TTTATTTTACAGAAAMGAMCTGCTTAMGAMGTM AA 
CMTTGCAGAGCTGGAGTTTCAGATGAGGGCT^ 
GCCCTAGAMTCGGTCATCTTGCATTTCCCGATTT^ 

TTTGGATTTATGAGTATAATG\GACAGT ATACCTGTGAAATTAAAGTATTTGACTCTTTG 

GTMCTTATCCAGGTCACACMGTMCMTTGCAGAGCrGGAG^ 
CTTGCGCTGCCGCTACAGAAAAGAGTGCCCTAGAMTCGGTCATCTTGGVTl^ 
TTAGTTTAGCCAAATGAAAAATTCC I I I IGGATTTATGAGTATAATCAGACAGTATACCT 
GTGAAATTAMGTATTTGACTCTTTGCTrGAMTMGT^ 

CGGGCGGAGTGGCTCACGCCTGTMTCCCAGC^CrrTGGGAGGCTGAGGCMGTAGATG\ 
[C,T] 

TTGAGGTCAGGAGTTCGAGACCAGCCTGACCAATATGGGGAMCCTCGTCT 
ATACAAAMTTAGCCGGGCGTGGTGGTGCATK 
GGCAGGAGMTCACTTGAAGCCAGGAGGCAGAGGTTACAGTGAGCT^ 
GCACTCCAGCCTGGGCMCAGAGCGCGACTCTGTCTMCAACA 
CACrTTATTAATGAAGAGTT CCTGACAAAGTGA I I I I I I IGGGGAGAAI I I I I ATAATTG 

TTTTTAAMTATTAAMCATTAAACTGCTClTCrCACCC^ 

TTTCAGTCAGGTGTCTGGGAGCTCGATGCAAGATAACAAMTCTGGTCTCT 

MCATGAMTCTGTrrGGGGMGCCAGAGOWWVTAAAGGI I I I AATAGCAAGCTCTCA 

CTMCTGCCCCTGGAMTCCACCCCACATCCTCCAGGAAGCCriTCTCT 

CCTCAGGAGCTTCTCCMGGCAGGCCCTTCCCAGAGCGCAGTOT 

[A,G] 

AGATGCTCCCTACACGCTGCAGGAMCTCCAGTGCCTGCAGCACAGGaTCAGC^ 

CTCGGGTTCTAGTCTCAGTCTGCT^ 

TAAACCTCTCTGTGCGTCAGCCTCCCAGGCTCGTT^ 

GMCAGGAGAGTGGGGATGGGMCTAGGTATCTTAMGCGGGGCAGAGTT^ 

GGCCACCCTTCGTATAGTTAGGAGGMGATGACGGGAGGCAT^ 

GCGTCAGCCTCCCAGGCTCGTTGC1TCAGGCCGCAGTTAGGCTGTGTGAACAG 
GGGATGGGAACTAGGTATCTTAMGCGGGGCAGAGTTTGGATGAGCGGGCCACCOT 
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ATAGTTAGGAGGMGATGACGGGAGGO\TGGAAGCTGGGATAGCCATCCrGACTCAGTGC 
TMTTCTGACACTTCAGMC^TCGAGTCAGTCTGACCTGCGAGrGAGCT^ 
CTTAGAAACTATTAGCACCTTGGACAAACrAC. 1 1 IU 1 1 CAGACCTGGTTGCTTCATGTC 
C-.G.T] 

GCGATGGGAAMCTGATACTTMCrrGCAGATAGTGGTGMTCAAAAGrAGTATATGTGA 
AGTACTCACACACTGCGGA(KATTCA^ 

TTGTAATATGAAAGCrAAACCATTTCTCGATGTGAGTG^GI I I I AATCGGCTACATAGTG 
ACTGGCATTCGATTTTAAAMTGT^ 

TATGTCACACrGTTTGAATCTCGGACCTGGI 1 1 U I I I 1 CTCCAGATGGTATGTTACTTA 

16786 TCTCGATGTGACTCAG I I I I AATCGGCTAG\TACTGAGTGGC^TTCG^TTTTAAAAATGT 
CAACTTGQGATCTGTCACCATGCTACrrACCATTTC 

ACCTGGI I IGI 1 1 I I CrCCAGATGGTATGTTACTTACGGTCATGAACrGATTTGGAAGAA 

CAQGGAGCCGCTAGTGAAMTCTGGCATGAMTMGGACTMTGGCCCGWVAMGGAGG 

TGGCTCTMGTAAMCTGGGATTGGACAGTAGTGGTGCATCTGGrC 

[G,C] 

i4 CCCCAGGAGACATCGGCTAGAGTGACCATGGCTATGCTCCCGTCTGGAAGATGCCAGCAT 
Q CTGGCCTCCCACTGTTTTCAGCTGTGTCCCCCAGTCCGTGTL I I 1 1 I AGAATGTGAATGA 

Q TGATAAAGTTCTGAAATAAAGGTTTCTATCTAGTTT GTAAGCAGATGrGTGTGTTCTCTC 
!•* TTTMGGGGCCGACACGGCTCTGGCATTTTGCTTTGG I IGI I GCATTGACAGGACCTGGG 

"H GAGAGTGCACCCTGAAAGGCCTGATCAGAACATGAAGGCGCTGGTTGCCTCT 

in 

|;3 17159 TGTTTTCAGCTGTGTCCCCCAGTCCGTGTCI I I I I AGAATGTGAATGATGATAAAGTTGT 

GAMTAMGGTTTCTATCTAGTTTGTAAGCAGATGTGTGTGTTCTCT 
1 A ACACGGCTCTGGCATTTTGCTTTGG I IGI I GCATTGACAGGACCTGGQGAGAGTGCACCC 

|U TGAMGGCCTGATCAGAACATGAAGGCGCTGGTTGCCTGTCT 
I. J CTGCXTAGCCTTCACTCTTCCTTGCCTCCCCCTCCCCTGGG 
P [G,A] 

t 3 AGAGTATCCCCTCTCCAGCACMTCTGAAATAACAGCTGCAGTATTTTCT 
i * GAAAGGTAGTGI I I I CTGGCAGTGAGTGGCATATACAAAMGCTATTTTCAGG 1 1 I IGCT 

TTCTAGGTTCMTTTGTAGATAMTTMGAGGTAGAMGMGTGATTTGG 
ACTT6AMTCTGAGCCGAATTTTATCTTCTGTTTGAM 
CTGAAAATAGCAGATAGTGGCTGTCGTCGTCACAGCCCrCACrGTTGTGGAAl^ 

17976 AAMGGAGTGGC^CTGGTGCC^ 

CCCAGC^CATAGAMTTGTCCAGTATTTGGCAGTCCTTG\TATCC I IU I CCATCAGGCT 

GGACI rGTl I CTACTATGATTTACAGTTATTOTCC(^GGCACAGGATTCTGTTCrAAAC 

TCGTATCACrTCTAGQQGAGAGAGTTATCTTAGCCATC^^ 

ACACGTOGTUTAGGGGCACTGCCCAAGGTCACMTGCTTTGCT^ 

[-,T,C] 

TGCAACACAGATGAGGCAAGATGCGI I 1 1 CCAGAGATGGGATAGGAGGCTGAGTTCATAG 

GGACATTCCCTCTAGAGCCCAACATTMTTCACATCGTGC^ 

AGGCMTGMGACATCTCTGTGTCCCTGCTTTGTGACTGGGAAAMGTTA 

TAGCATCTCCTGGTCCCTAAMCCCCTCMTGCTGGAGCCTCTGTGCATGGCCT 

GCCAGMCCTGGCTGTGGCCGGAGAAGCCTTGCTGTCCACAGCTC^ 
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TCACCAGGCMGT^GAACTC^ 

TTTGGCAGTCCTTCATATCL I I CI I CCATCAGGCTGGAC I I CI I I CTACTATGATTTACA 

GTTATTCTTCCCAGGCACAGGATTCTGTTCTAMCTCGTA^ 

TATCTTAGCC^TCATTTTGCCAGCGAGGAMCGGC^CACGTGGTGTAGGGGC^ 

AGGTCAO\ATGCT7TGCTCTGACATCrGCTMO 

[G,A] 

TTTTCCAGAGATGGGATAGGAGGCTGAGTITCATAGGGACA^ 

TMTTCACATCGTGC1TTGGGCAGACCAGGCAAAGAGGCAATGMGA CTGTGTCC 
CTGCTTTGTGACTGGGAAAMGTTAGMGTCCCTGTAGC^TCTCCT GGTCCCTAAAACCC 
CrCAATGCTGGAGCCTCTGT GCATGGCCTGGGGAGGCCAGAACCTGGCTGTGGCCGGAGA 
AGCCTTGCTGrCCACAGCrCCCTCCTGATTGCCCACGAQGGrGCTrC^ 

GCATQGCAGCACGCGCCO^GCACATAGAAATTGrCCAGTAT^ 

TTCTTCCATCAGGCTGGALI l(jl I I CTACTATGATTTACAGTTATTCTTCCCAGGCACAG 

GATTCTGTTCTAMCTCGTATCACITCTAGGGGAGAGAGTTATCTTA^ 

(^GCGAGGAMCGGCAGVCGTGGTGrAQGGGCACrGCCCAAGGTCACMTGCTTT 

GACATCTGCTMCAACTGCMCACAGA^ 

[G,T] 

AGGCTGAGTTCATAGGGACATTCCCrCTAGAGCCCMCATTMTTC^ 
GCAGACCAGGCAMGAGGCAATGAAGAG^TCrCTGrrGTCCCT GCTTTGTGACTGGGAAAA 
AGTTAGMCTCCCTCTAGCATCTCCrGCTCCCTAAMCCCCTCMTCCTGGAGCCTCT^ 
GCATGGCCTGGGGAGGCCA^ 

CCrCCrGATTGCCCACGAGGGTGClTCAClTTCrCCTClTGGCrrCT 

CATGGCAGCACGCGCCCAGCACATAGAMTTGTCCAGTATTTGGCAGTC(^ 
TCTTCCATCAQGCTGGAC I IGI I I CTACTATGATTTACAGTTATTCTTCCCAGGCACAGG 
ATTCTGTTCTAMCrCGTATCACTTCTAGGGGAGAGAGTTAT 
AGCGAGGAMCGGCACACGTGGTGTAQQGGCACTGCCCMGGT<^CMTGC^ 
ACATCTGCTAACAACTGCAACACAGATGAGGG\AGATGCG I 1 1 I CCAGAGATGGGATAGG 
[A,G] 

GGCTGAGTTCATAGGGACATTCCCTCTAGAGCCCMCATTMTTCACATCCTGCITT^ 
CAGACCAGGCAMGAGGCMTGMGACATCTCTGTGTCCCTGC^ 
GTTAGMGTCCCTGTAGCATCTCCTGGTCCCTAAAACCCCTCMTGCrGGAGCCTCT 
CATGGCCTGGGGAGGCCAGMCCT^^ 

CTCCTGATTGCCC^CGAGGGTGCrrCACTTTCrCCT CTTGGCTTCTCTGGGGACCCGCGA 

ACATAGAMTTGTCCAGTATTTGGG^GTCCTTCATATCL I I CI I CCATCAGGCTGGACTT 

GTTTCTACTATGATTTACAGTTATTCTTCCCAGGC^ 

CACTTCTAGGGQ\(^GAGTTA^ 

GGTGTAGGGGCACTGCCCMGGTCACAATGCTTTGCrCTGAG^TCT 

CACAGATGAGGCAAGATGCGTTTTCCAGAGATGGGATAGGAGGCTGAGTTC^ 

[T,G] 

TCCCTCT AGACXCCAACATTMTTCACXTCCT 
TGMGACATCTCTGTGTCCCTGCTTTGTGACTGG 
CTCCTGGTCCCTAAMCCCCTCAATC 
ACCTGGCTCTGGCCGGAG^ 
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TGCTTCACTTTCT CCTCrrGGCrrCTCTGGGGACCCGCGATCACTGCCTTG\AGGCG\TG 

18375 GCrTTGGGG\GACCAGGCAMGAGGGV\TGMGACATCTCTGTGTCCCr 

GGGAAAAAGTT AGAAGT CCCTGTAGCATCTCCTGGTCCCTAAAACCCCTCAATGCTGGAG 
CCT CTGTGCATGGCCTGGGGAGGCCAGMCCTGGCTGTGGCCGGAGMGCCT^ 
ACAGCTCCCT CCTGATTGCCCACGAGGGTGCTTCACTTTCTCCTCTTGGCT^ 
ACCCGCGATG^CTGCCTTCMGGCCATGCACTCCCTGGCCCGTGGGCCT CTTGGGCTGTG 
[C,T] 

CGCCTCCACTGGCATCTGMGTGTGGGGTACCrAGGA^CATGCCGTGGCTGCCGTCT 
TCATTCCATACACTTCTTGAGTGGGTGCACTTGCrGMGCCT 

CTGAGCTCCAGACCCACAGMTCTCrCTGTACTCTT AGTAMTGTGTCTACTGCAACACA 
CGCATGGTTCCAGGCTCTGGGACCACCCCCCCGCCCTGG^CAGGCCCCTCAMTAGC^ 
CGGCTT MGGAGTGACACGAGOV\TCGGTGMGTCTGAMCCCGGAGCCATTCGAGATCT 

19244 CTAGATGGTCACTACACrCAGGGAGTTGGGGATGGCTCAGAGCTGTTAA 
CTGCCCAGGAGGACCTGCCTGAGGGGTGGGG 
TTGAAAGACCTGGATTCMGTCTGVVCCCAAGCCCrGGCCAGCTCTG 
GT CGGCCT CACT CT CT GCCCCTCAGTGGGCTCCTGTGTAGATGGGGATMTGATGGCTTT 
ATATCCTGAGMTGTGGGGAGGGGATTMGTGGCGAAAATACCTGAGAGTGCGCACTCAG 
[T,C] 

GCCTGGCTCAGCAMTGCCCTrGTTCCCTCCTTCCCTCTCCCCAGAACCCCTCCTCCCCT 
TLI ILI ILI 1 1 I I I 1 1 I I I I 1 1 I 1 1 I I GACCCAGAGTCTTGCTATGTTGCCCAGGCTGGA 
GTGG^GTGGCACAATCTCGGCrCACTGCMCCTCGACCTCCTGGCTTCAGGCMTTCTTG 
TGCCrCAGCCTCrCGAGTAGCTGGGATTAC^GGCAGGG^CCATCACGCCCGGCTMTTTT 
1 1 I I I 1 1 1 1 1 1 I I I GT AGTAGAMTGGGATTTCACCATATTGGCAGGATGTTCTCGATCT 

Chromosome map: 

Chromosome 3 
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